A Lexicon Driven Approach for Off-line Recognition of Unconstrained Handwritten Korean Words
نویسندگان
چکیده
We propose a new method for the recognition of unconstrained handwritten words consisting of Korean and numeric characters. To overcome the difficulty in separating touching characters, we adopt an oversegmentation technique and we find the optimal segment combination using a lexicon-driven word scoring technique and a nearest neighbor classifier. The optimal combination gives the final segmentation positions for individual characters with the best matching word in the lexicon. The proposed system has yielded an accuracy of 90.64% for 908 word images on live mail pieces.
منابع مشابه
A lexicon-driven approach for optimal segment combination in off-line recognition of unconstrained handwritten Korean words
We propose a new method for o!-line recognition of unconstrained handwritten words consisting of Korean and numeric characters. To overcome the di$culty in separating touching characters, we adopt an over-segmentation strategy. Given a slice of the input word image, we "nd the optimal segment combination using a lexicon-driven word scoring technique and a nearest-neighbor classi"er. The optimal...
متن کاملAn HMM-Based Approach for Off-Line Unconstrained Handwritten Word Modeling and Recognition
ÐThis paper describes a hidden Markov model-based approach designed to recognize off-line unconstrained handwritten words for large vocabularies. After preprocessing, a word image is segmented into letters or pseudoletters and represented by two feature sequences of equal length, each consisting of an alternating sequence of shape-symbols and segmentationsymbols, which are both explicitly model...
متن کاملیک روش دو مرحلهای برای بازشناسی کلمات دستنوشته فارسی به کمک بلوکبندی تطبیقی گرادیان تصویر
This paper presented a two step method for offline handwritten Farsi word recognition. In first step, in order to improve the recognition accuracy and speed, an algorithm proposed for initial eliminating lexicon entries unlikely to match the input image. For lexicon reduction, the words of lexicon are clustered using ISOCLUS and Hierarchal clustering algorithm. Clustering is based on the featur...
متن کاملA Lexicon Driven Method for Unconstrained Bangla Handwritten Word Recognition
In this paper a lexicon driven segmentationrecognition scheme for unconstrained Bangla handwritten word recognition is proposed for Indian postal automation. In the proposed method, at first, binarization of the input document is done and slant correction of the individual words is performed. Next, using water reservoir concept words are pre-segmented into possible primitive components (charact...
متن کاملTurkish handwritten text recognition: a case of agglutinative languages
We describe a system for recognizing unconstrained Turkish handwritten text. Turkish has agglutinative morphology and theoretically an infinite number of words that can be generated by adding more suffixes to the word. This makes lexicon-based recognition approaches, where the most likely word is selected among all the alternatives in a lexicon, unsuitable for Turkish. We describe our approach ...
متن کامل